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ABSTRACT 

Two classes of gamma-ray bursts (GRBs), short and long, have been determined with¬ 
out any doubts, and are usually ascribed to different progenitors, yet these classes 
overlap for a variety of descriptive parameters. A subsample of 46 long and 22 short 
Fermi GRBs with estimated Hurst Exponents (HEs), complemented by minimum 
variability time-scales (MVTS) and durations (Tgo) is used to perform a supervised 
Machine Learning (ML) and Monte Carlo (MC) simulation using a Support Vector 
Machine (SVM) algorithm. It is found that while Tgo itself performs very well in dis¬ 
tinguishing short and long GRBs, the overall success ratio is higher when the training 
set is complemented by MVTS and HE. These results may allow to introduce a new 
(non-linear) parameter that might provide less ambiguous classification of GRBs. 
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1 INTRODUCTION 

Gamma-ray bursts (GRBs) are highly energetic events that 
often are brighter than all other 7-ray objects visible on 
the sky combined, with an emiss ion peak in the 200-500 
keV region (for recent reviews, see Nakar 2007 ' L Zhangj 20111 
Gehrels fe Razzaauel 120131 : iBerger 120141 : IMeszaros fc Reed 
20151 ). GRBs were de t ected by military satellites Vela in late 
1960’s. iMazets et alJ (| 198lh first observed a bimodal distri¬ 
bution of T90 (time during which 90% of the burst’s fluence is 
accumulated) drawn for 143 events detected in the KONUS 
experiment. BATSE o nboard the C ompton Gamma Ray 
Observatory (CGR O) llMeegan et al.l 11992)4 allow ed to con¬ 
firm the hypothesis dKlebesadeL Strong fc Olsonll 19731 ) that 
GRBs are of extragalactic origin due to isotropic angular 
distribution in the sky. However, a more complete sample of 
BATSE short GRBs were shown to be distributed anisotrop- 
ica lly and cosmological con sequences were discussed lately 
by IMeszaros fe Reesl (|2015h . BATSE IB data release was 
followed by further inv estigation of the Tgo distribution 
(|Kouveliotou et ahll 19931 ) that lead to establishing the com¬ 
mon classification of GRBs into short (T90 < 2 s) and long 
(T90 > 2 s), and based on which GRBs are most commonly 
classified. The progenitors of lon g GRBs are associated with 
supernovae (IWooslev fc Blooml 120061 1 related with collapse 
of massive stars. Progenitors of short GRBs are thought to 
be NS-NS or NS-BH mergers (lNa.ka.Tll2007h . and no connec- 
tion between short GRBs and supernovae has been proven 
llZhang et alJl2009h . 

Despite initial isotropy, short GRBs were shown to be 
distributed anisotropically on the sky, while long GRBs are 
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distributed isotropically (|BaIazs^4eszajx)S_j£MIoryathil99^; 
Ma gliocchetti. Ghirlnnda^__Celotti |2003|; IVavrek et al.l 
2008) (see also IMeszaros fc Stocekl 120031 for an analysis 
that revealed anisotropy in a set of long GRBs). Since 
long a possibility that GRBs may be divided into more 
than two classes was put forward as there are GRBs that 
do n o t fall easily into one class or another ([Gehrels et al 


120061: IZhang et al.l 120071 : iBromberg et al.l 120131 '). Horvath 


mg e 

19981. 2002h in vestigated GRBs from the BATSE catalog 


( Meegan et al.l 1 19961 : IPaciesas et al.l 1 19991 ') by a univariate 
approach and concluded that the probability that the peak 
at intermediate values of logTgo is a result of random fluc¬ 
tuations is much less than 1%. Also in Swift data evidence 
for a t hir d component in lo g T90 was found ( Horvathet_ah| 
2008allbl: IZhang et al.l 120071: iHui a. Meszaros fc Rfpal 120091 : 


Huia fc 6,m a|[2CI0^ Horvath et alJl2010h . Other dat asets, i.e. 
RHESSI i ftfpa et alJl2009l . l2012l i . or BeppoSAX (iHorvathl 
l2009h , also show evidence for an intermediate class. It is 
worth to note that the interm ediate GRB s are distr i buted 
anisotropically on the sky IjM eszaros. Bagol v fc Vavrekl 
120001 : IMeszaros et al.l l200d : Vavrek et al.l 20081 ). The origin 
and existence of an intermediate class remains elusive as 
theoretical models still need to accou nt for an apparent 
bimodality in duration distribution (Ja niuk et al.l 120061 : 
lNakanl2007f ). Finally, not only the existence of an interme¬ 
diate class was investigated (and remains unsettled), but 
also subclass cl assifications of long GRBs were proposed 
llGao. Lu fc Zhansll2010l) . 

The research conducted on Fermi data llGruber et al.l 


l2014l:lvon Kienlin et al. 


20141) are consiste nt with a bimodal 


duration distribution (Zhang et al. 1 2012: as well as wit h 
the existence of an intermediate class ( Horvath et al. IB); 
however, even a bimodal structure was not present in some 
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energy bands in the examined sample of 315 GRBs (not to 
mention trim odality) an d may be due to an instrumental se¬ 
lection effect jQin et al.ll2013 ). It is important to note that 
in all mentioned research only a mixture of standard normal 
distributions (Gaussians) was fitted to the observed log Tgo 
distributions. By examining the duration distribution it was 
shown that the thir d cl ass is u nlikely to be present in the 
Fermi data llTarnopo lski 2015aj), and that a mixture of two 
intrinsically skewed distributions follows better the logTgo 
distribution of Fermi GR Bs than a mixture of three stan¬ 
dard Gaussians (iTarnopo lski 2015bl). 

The Fermi dataset has been examined widely for 
other reasons, too. Its re dshift distribution was investigated 
llAckermann et ahl 120131) confirming the observation that 
short GRBs have lower redshift s than long GRBs. The Am- 
ati co rrelation was investigated llBasak fe Radl2013l : I Gruber! 
120121 ) a nd a link between sh ort and long GRBs was dis¬ 
covered llMuccino et alj 20131). It gave insig ht into the GRB 
afterglow population ( Racusin et al.l 2011). allowed to ob¬ 
serve a number of high-energy GRBs (with ph oton e nergie s 
exceeding 100 MeV, or even 10 GeV) (I Atwood et al.ll2013l) . 
and provided a ve rification of the short-long classification 
iZhang et al.l[2012 ). 

Other parameters, despite Tgo durations (e.g., similarly 
defined T50 or hardness ratios for various bands) were used 
to classify GRBs for different satellite databases as well. 
A distinction in the hardness ratio i /32 of all three classes 
was f ormulated early (jMukheriee et allllOOSl : [ihjrvgjjLj'LjdJ 
200j). Minimum v ariabili ty time -scale (MVTS) I Bhatl 20131 : 


MacLachlan et al. 1 I2012I. l2013aHbh was used as a distinc¬ 


tive parameter for differentiating GRB lightcurves based 
on their temporal structure. It is a non-param etric feature , 
such as the Hurst exponent ( HE, denoted H) iHurs H ll95ll: 
IMandelbrot fc van Nesslll968i ). which is a quantitative mea¬ 
sure of the persistence of the signal, i.e. it reveals a temporal 
trend in the overall behavior in the data, and was suggested 
to be applicable in GRB classifications (|MacLachlan et all 
l2013bl) . 

Among many existing computati onal algorithms for es¬ 
timat ing H (Rescaled Range Analysis (IMandelbrot fc W allis 
196£), Detrended Fluctua tion A nal ysis IPeng et a l. 1994 . 
199E ), wavelet approach JSimonsen L j^apgen_& 1 Ne^ , 1£ 


Detrended Moving Average ( Allessio et al • 2002 I ). to men¬ 
tion only a few), their extraction for the Fermi GRB sam¬ 
ple was performed by the wavelet appro ach with the Haar 
wavele t as a basis (for further details, see lMacLachlan et al.1 
l2013bl and Sect. 12.31 herein for a brief description). It was 
found that there is an offset between means of H distribu¬ 
tions of long and short GRBs that may serve as a criterion 
for distinguishing these two classes. Moreover, it was pro¬ 
posed that the overlap region of HE distribution may be 
related to the third class of GRBs (possibly intermediate). 
However, the region where short and long GRBs overlap is 
significant due to a large dispersion in H, and the union 
distribution is skewed toward values less than 0.5. 

Additional parameters have been defined and pro¬ 
posed for GRB classification as well. Examples are 
e = ^ 7 ,iso, 52 /E^l 2 (where E-y, iso, 52 is the isotropic gamma- 
ray energy given in 10 52 erg, and E PtZ) 2 is the cosmic 
rest-frame spectral peak energy in units of 100 keV) 
unam biguousl y dividing short and long GRBs !Lii_et al.1 
l20ld) . MVTS dBhatll2013l: [MacLachlan et al.ll2012U2013al lbl: 


Golkhou fe Butler _ 20141: Golkhou. Butler fe Littleiohnsl 

20151) or HE I MacLachlan et al.l2013bl ). However, providing 


a set of parameters that could classify GRBs with high 
accuracy remains a desired objective. Unsupervised machine 
learning (ML) algorithms, trained on a dataset containing 
these parameters, yield a possibility of attaining this goal. 
Before that, performance of supervised ML with arbitrary 
number of classes needs to be tested. 

I 11 this work, focus is laid on performance of HEs, com¬ 
plemented by MVTS (both being wavelet-based parameters 
new in GRB classification) and Tgo, being a classical GRB 
type indicator, in classifying short and long GRBs. This pa¬ 
per is organized in the following manner. In Sect. [2] a descrip¬ 
tion of the data is provided. I 11 Sect. [3] a statistical analy¬ 
sis of the available dataset of GRBs with computed HEs 
is conducted. In Sect. [4] ML is applied to (77, MVTS, Tgo) 
singles, pairs and triples, and a Monte Carlo simulation pro¬ 
vides statistics of the success ratio. Section [5] contains dis¬ 
cussion of the results, and Sect.[6]gives concluding remarks. 
A computer algebra system Mathematica® vl0.0.2 is used 
throughout this paper. 


2 THE SAMPLE 


2.1 Selection criteria 


The data are taken from MacLachlan et_akj_j [201 3a b). 
and are complemented by Golkhou fc Butler! 1 20141 ): 
iGolkhou. Butler fe Littleiohnsl ( 20151 ). They consist of 46 


long and 22 short GRBs detected by t he Fermi satellite. 
According to iMacLachlan et ahl (l2013al ) , the selection cri¬ 
teria fo r the GRB sample examined in IMacLachlan et ahl 
(l2013bh are two-fold. The following condition on the ratio 
of variances for one or more octaves j, 




preburst 


< 0.75, 


( 1 ) 


was required, where ^P reburBt corresponds to a certain time 
interval before the burst]]], used as a surrogate for the back¬ 
ground variance, and /?!? urst expresses the variability of the 
burst. The variances were computed according to 


A = 


1 

rij 


rij-l 

y ] \dj,k 1 , 


( 2 ) 


where cfy*, are the wavelet transform coefficient (for further 
details, see MacLach lan et al.ll2013bl and Sect. l2.3l herein for 
a brief explanation). Additionally, it was required that the 
lightcurve fits had a deduced < 2. 


2.2 Duration Tgo 

The duration Tgo is the time during which the cumulative 
counts increase from 5% to 95% above background, encom¬ 
passing 90% of the total fluence detected. This way of mea¬ 
suring the duration is independent of the intensity for a given 


1 For long GBs the preburst was defined to start 20 s and finish 
5 s before the trigger, and for short GRBs these were 15 s and 1 s, 
respectively. 
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Figure 1. Distributions of logTgo: dashed red - short GRBs, 
dotted blue - long GRBs. 
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Figure 2. Distributions of HEs: dashed red - short GRBs, dotted 
blue - long GRBs. 


instrument (iKouveliotou et allll 9931 ). The detection is trig¬ 
gered when the signal exceeds a certain threshold. 

Fermi has a time resolution of 2/rs; the lightcurves are 
extracted at a binning of 200/ts. Time intervals that are far 
before and far after the main burst are selected as back¬ 
ground and subtracted from the light c urve. The procedure 
is described in detail bv lKoshut et all (119961) . 

The durations used herein come from the Fe r mi GRB 
Burst-Catalogue as cited in iMacLachlan et ah (l2013bl) . 
They are also available in the online catalogu^ The Tgo 
distribution of GRBs examined herein is displayed in Fig. [l] 

2.3 Hurst Exponent 

The HE describes the level of statistical self-similarity of 
a time-series. A time-series X(t) with N data points ti is 
called self-similar (or self-affine) with a Hurst exponent F[ 
if, after a rescaling t —¥ At, it satisfies the following relation 

X(t) = X~ H X(Xt), (3) 

where A denotes equality in distribution. T h e HEs under 
study were calculated by IMacLachlan et all d2013bi) with 
the fast wavelet transform using the Haar wavelet as the 
basis. The basis is obtained from a mother wavelet, , by 

'Pit) -t ’ ( 4 ) 

where j represents the octave (time-scale) and k the position 
of the wavelet. The wavelet transform coefficient is defined 
as 

— (Xjipj’k') • (5) 

By inserting Eq. [3] into Eq. [5] one gets the relation be¬ 
tween the variance of the wavelet coefficients d :h k and the 
scale j: 

log 2 var(dy*,) = (2 FI + 1 )j + const. (6) 

The HE is obtained by fitting a line to the linear part of the 
log 2 var (dj. k) vs. .7 relation. For furthe r details, the reader is 
referred to IMacLachlan et al.l d2013bl ). 

2 http://heasarc.gsfc.nasa.gov/W3Browse/fermi/fermigbrst.html 


The HE is bounded in the interval [0,1] and is equal 
to 0.5 for a random walk (Brownion motion). For persistent 
data H > 0.5, while for anti-persistent FI < 0.5. Regular 
(periodic or quasi-periodic) time series posses H = 1. It is 
related to the fractal dimension of the signal D = 2 — H, pro¬ 
viding information about its statistical self-similarity. This 
means that in a persist e nt pr ocess the increments are persis¬ 
tent themselves dClegsll2006l ) with the same H. A Brownian 
motion has independent increments. 

The HE distribution of GRBs examined herein is dis¬ 
played in Fig. [5] Note there are two small negative values. 
The negative sign most likely comes about because of poor 
quality of the lightcurves, and the features of the lightcurves 
may prevent the algorithm used to converge to a meaning¬ 
ful value (Sonbas, private communication). Most likely these 
HEs are very small, but positive. However, for completeness 
reasons and in order not to further diminish the GRB sam¬ 
ple used, those values are not rejected from the analysis as it 
might happen in future research that other lightcurves may 
also exhibit a negative value in the frame of the method 
applied (in this case, the wavelet approach). Keeping those 
two values takes into account numerical artifacts that might 
be encountered in other GRBs. Nevertheless, as those HEs 
correspond to short GRBs and are consistently placed in the 
HE histogram, they should not affect the short/long classi¬ 
fication. 


2.4 Minimum Variability Time-Scale 

If the lightcurve is binned into too narrow bins, the intrinsic 
variability is lost in the statistical noise. When the binning is 
too coarse, the variability might disappear. Hence, the right 
binning is crucial and varies with the lightcurve. To infer the 
right binning, a comparison of the variances of the GRB and 
the background is performed. A ratio of these variances is 
plotted against the bin-width and the MVTS is the binning 
at which this ratio is minimized (lBhatH2013l) . 

Expressing this concept in other words, MVTS is a time- 
scale at which a transition between white noise and a power 
law is observed in the power density spectrum. This means 
that MVTS marks the time-scale at which a power law pro¬ 
cess and a white noise exchange dominance. As wavelets are 
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Figure 3. Distributions of log MVTS: dashed red - short GRBs, 
dotted blue - long GRBs. 


sensitive to whichever process dominates at a given time- 
scale, they are a natural choice to extract the MVTS from 
the lightcurves. 

The wavelet-based MVTS extraction is carried out sim¬ 
ilarly to the HE estimation described in Sect. 12.31 Starting 
from Eq.[6] with the background subtraction as described in 
Sect. E2J the MVTS is calculated at the octave ^'intersection 
at wchich an intersection of a flat noise spectrum and a lin¬ 
ear relation associated with the power law domain in the 
log-scale diagram occurs: 

MVTS = Tbin x 2 Jinteraeotion , (7) 

where Tbin is the finest binni ng of the data. For further de¬ 
tails, the reader is referred to iMacLachlan et alj d2013al l. 

The MVTS used i n this study are taken from 
IMacLachlan et ahl f 2013al ). although some values for the 
short GRBs subsample were not reported there. Hence, 
the missing value s were re placed with the MVTS from 
iGolkhou fc Buffer! (1201413 : IGolkhou. Butler fc Littlejohns! 
( 201513 . The distribution of the sample under consideration 
is shown in Fig. [3] 


3 INDEPENDENCE OF SHORT AND LONG 
GRBS’ HE DISTRIBUTIONS 

In Fig. m the HE distr i bution s for short and long GRBs 
from IMacLachlan et ahl d2013bl) are reproduced. While the 
systematic difference is clearly visible, the histograms over¬ 
lap strongly. This leads to a question whether they are 
statistically distinguishable. As most of statistical tests 
dKendall fc St uartll l 9731 : iKaniil 120061 : iPress et~ahl 1200713 as¬ 
sume normality, first a Kolmogorov-Smirnov test dMassevI 
|l953) is performed, that resulted in p-values of 0.95 and 
0.87 for short and long GRBs, respectively. These val¬ 
ues are large, so might lead to a conclusion that both 
samples are normally distributed. However, in the sam¬ 
ple of long GRBs, ties are present, and the p -value is 
obtained after ignoring them. A mean of hundred Monte 
Carlo realisations is equal to 0.8735 ± 0.0102, where the er¬ 
ror is the standard deviation of the mean. Another trial 
gave 0.8728 ± 0.0106. To be sure these data are nor¬ 
mally distributed two more tests are performed. The Pear¬ 


son x 2 test dVoinov. Nikulin fc Balakrishnanll2013l3 gave p- 
values equal to 0.75 and 0.81, and the Shapiro-Wilk test 
dShapiro fc Wilk|[l965l 3 gave 0.33 and 0.39 for short and long 
GRBs, respectively. Despite these values span a quite broad 
interval, they are all significant enough to ascertain that the 
HE distributions are normal. 

To check whether they are separate, first a f-test 
((Stvidentl 11908l 3 is performed, based on which at the 0.01 
significance level the hypothesis that the means’ difference 
is /riong — Mshort = 0.163215 cannot be rejected. The means 
were computed from the samp les. Also, the Mann-Whitney 
test dMann fc Whitnevl 1194713 does not allow to reject the 
hypothesis that the medians’ difference is mi ong — m s hort = 
0.144 at the 0.01 significance level. Finally , variance tests: 
Brown-Forsythe dBrown fc Forsvt ~h3 1 1 97l l) . Fisher Ratio 
dPress et al. l20()7l ). and Levene ( Levendll960lh all gave p- 
values greater than 0.9. All of these tests imply that the HE 
distributions indeed are different for short and long GRBs. 


4 MACHINE LEARNING 

As other schemes of classifying GRBs were proposed, e.g. 
based on MVTS, it is desired to investigate their efficiency 
for distinguishing short and long GRBs. As the amount of 
GRBs classified in such a way is still limited, in the following 
section focus is paid only to the two classical classes. 


4.1 Methods 

In o rder to verify whether H c an serve as a GRB type indica¬ 
tor (IMacLachlan et a l. 2013b), a supervised machine learn¬ 
ing (ML) is performed. A support vector machine (SVM) 
algrithm is employed. It aims at building a potentially non¬ 
linear model that is trained over a set of examples that are 
classified as belonging to one of arbitrarily specified cate¬ 
gories (short or long in this case). The training set is repre¬ 
sented in a multi-dimensional space, and the classification is 
done so that the categories are divided by a gap as wide as 
possible. After the SVM is trained, a classification probabil¬ 
ity is derived over the set of applicable parameters. Based on 
this classification, new examples are mapped into the same 
space and predicted to belong to one category or another 
based on which side of the dividing hyperplane they fall on. 
The probability threshold is set at 50%, and no undeter¬ 
mined classifications are allowed. 

The parameter space is three-dimensional, and consti¬ 
tutes of H, MVTS and Tgo. It is found that the number of 
mistaken classifications is reduced when using a space of H, 
log MVTS and log Tgo, and this semi-log space is explored 
throughout this section. A subsample of the total 22 short 
and 46 long GRBs is taken to be a training set, while the 
remainings GRBs serve as a validation set. A subsample is 
created in the following way. For each training set, 20 out of 
22 short GRBs are chosen. The number of these subsets is 
equal to Qo) = 231. Among long GRBs, 42 out of 46 exam¬ 
ples are randomly chosen for each of the short GRBs subsets, 
and this is repeated 435 times. A total of ~ 10 s SVM mod- 
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els is built, and they exhaust the number of realisations of 
short GRBs subset jj. 

ML is done in seven spaces: 

(i) three univariate classifications based on H, log MVTS 
and log X90, 

(ii) three bivariate classifications based on pairs 
(H, log MVTS), (H, logTgo) and (log T 90 , log MVTS), 

(iii) a trivariate classification based on triples 
(H, log MVTS, logT 90 ). 

For each, a success ratio r is inferred from the number of 
correct classifications in the following manner. If both short 
GRBs from the validation set (consisting of two short and 
four long GRBs) were classified correctly by an SVM model, 
the success ratio r s hort is equal to 1; if one is correct and the 
other is false, r s hort = 0.5; and if both were classified falsely 
as long GRBs, r s hort = 0. Similarly for long GRBs, n ong is 
computed, and takes the values 0, 0.25, 0.5, 0.75 or 1. A total 
success ratio is defined as 

2 r s hort T R’lonn . . 

r tot = ---~, (8) 

and the success ratios are calculated for all ~ 10 5 trials. 
The relative frequency for each r is obtained by dividing the 
counts by the number of trials. 


4.2 Results 

4-2.1 ID parameter spaces 

The results of ML for univariate distributions of parameters 
under consideration are displayed in Fig. [4] The HEs, sug¬ 
gested to be a candidate for distinguishi ng between short 
and long GRBs dMacLachlan et alj|2013bl ). exhibit a rather 
poor performance: 57% of short GRBs were classified im¬ 
properly, and correct matches took place for only 5% of 
them. On the other hand, 81% of long GRBs were classi¬ 
fied correctly. This leads to a peak in total matches at the 
level of 54% for the success ratio rtot = 0.67. Overall, this is 
an unsatisfactory result that comes from a large overlap of 
HEs for short and long GRBs (compare with Fig. 0. 

Using log MVTS similar results are obtained, with an 
even higher frequency of mismatches for short GRBs, equal 
to 82%. On the other hand, it performs slightly better in 
distinguishing long GRBs with an ri on g = 1 in 88% of val¬ 
idation sets. This leads to a peak of 76% at rtot = 0.67, 
which is a considerable accuracy. 

Historically, the distinction between short and 
long GRBs was made based on logTgo distribution 
dKouveliotou et aI.lll993T l. and the classification is straight¬ 
forward: if Tgo < 2s (Tgo > 2s), a GRB is short (long). 
Nevertheless, given that SVM classification is probability- 
based, one can expect a lower than 100% accuracy, especially 
that both short and long GRBs may be arbitrarily close 
to the limit Tgo = 2 s. Indeed, short (long) GRBs were 
properly recognized in 99.95% (91.64%) cases, what gives a 
net accuracy of 91.59% (at rtot = 1). However, as logTgo 
ML is effective for short GRBs, while log MVTS and H for 
long ones, their combination is likely to give more reliable 
results. For both HE and log MVTS distributions, the 

3 There is a total of ^q) x ( 42 ) ~ 3.77 • 10 7 subsets. 
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probabilities favor long GRBs (compare with right column 
in Fig. [4]). 


4-2.2 2D parameter spaces 

As both H and log MVTS distributions exhibit large over¬ 
laps for GRB populations, the results of ML displayed in 
Fig. [5] for a set of pairs (H, log MVTS) are neither surpris¬ 
ing, nor interesting. Short GRBs are falsely classified in 52% 
of cases, while long GRBs are correctly recognized in 79% 
of validation sets. This gives a peak of 53% at rtot = 0.67. 

However, when HEs are complemented with logTgo, 
an accuracy of 100% for long GRBs is attained. For short 
GRBs, an accuracy of 94% is obtained, what results in a to¬ 
tal proper recognition level of also 94%. Note this is a better 
result than for H and log Tgo alone. 

Distributions in log MVTS have larger overlaps than 
in H , hence the performance of logTgo complemented with 
MVTS is slightly worse than in the previous case: success 
ratios equal to unity are obtained in 96% and 93% for short 
and long GRBs, recpectively, and rtot = 1 in 88% cases. Con¬ 
sidering this space as durations complemented with MVTS, 
the accuracy is lower. On the other hand, when MVTS are 
complemented with logTgo, the performance is much better 
than for MVTS alone. Again, when the populations overlap 
strongly, the probabilities favor long GRBs (compare with 
right column in Fig. 01. 


4-2.3 3D parameter spaces 

When ML is performed on a three-dimensional training set 
(see right panel in Fig. 0 of triples (H, log MVTS, log Tgo), 
short GRBs are classified properly in 95.18%, long GRBs 
in 99.99%, and the overall accuracy is higher than 95%. 
Comparing with ID and 2D approaches, this gives a 
significant improvement (see Fig. 0. Specifically, when 
(logTgo, log MVTS) is complemented with HEs, the gain 
in accuracy is almost 7%. It is not surprising, however, that 
complementing (H, log MVTS) with durations gives a sig¬ 
nificant improvement, as Tgo allows to introduce a clear sep¬ 
aration between the populations, which are highly mixed 
otherwise. Moreover, adding log MVTS to (H, logTgo) gives 
an improvement of about 1.5%. 


5 DISCUSSION 


Unsupervised ML gives hints about the number of classes in 
a dataset. This is, however, a phenomenological approach 
that does not have to reflect differences in physical pro¬ 
cesse s underlying their orig in dHorvath et alJ 120061 : iNakarl 
120071 ; [Bromberg et al.l [2013T) . Given the small dataset with 
computed HEs, a supervised ML was performed in order to 
verify whether the feat ures recently introduced to character- 
ize GRBs, i.e., MVTS dBhat 201 ^MucLachlan et alJl2012j . 
l2013al ) and HE dMacLachlan et all 201 31)1) . are suitable for 
automated classification. As the commonly accepted division 
of GRBs into short and long according to Tgo is its key, based 
on which the classes for an SVM classifier were attributed 
to available MVTS and HE values, a three-dimensional pa¬ 
rameter space (H, log MVTS, log Tgo) was studied. It was 
shown that the accuracy was much greater in this parameter 
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Figure 4. Frequencies of successful univariate predictions. Rows , from upper: based on HEs, on MVTS, and on durations Tgo. Columns , 
from left: fraction of successes for short GRBs, for long GRB, and for a union of both. Right panel displays examples of probability 
functions learnt by SVM. Dashed green - short GRBs, solid red - long GRBs. Dots mark the training set: green (upper row) - short 
GRBs, red (bottom row) - long GRBs. 


space than in a one- or two-dimensional parameter spaces. 
Especially, complementing (log MVTS, log Tgo) by H lead 
to a better performance. Therefore, the HE is considered a 
candidate indicator for distinguishing short and long GRBs, 
however its performance alone was rather poor. On the other 
hand, a sample of 22 + 46 = 68 GRBs with calculated HEs 
is not numerous enough to adjudicate with certainty what 
is the genuine overlap for the whole population. 

Another issue is to construct the training set; specif¬ 
ically, to unambiguously attribute classes to the objects 
observed. The conservative limit of Tgo = 2s was shown 
to be appropriate for BATSE GRBs, but for Swift data a 
divis ion at Tgo = 0.8 s is more suitable jBromberg et al.l 
120131 ). The plausible limits were derived (by means of 
a local minimum of a bimodal distr ibution of T g o dura - 
tions) for a number of catalogs by iTarnopolskil d2015d ) 
and gave values in the range 2.05 s to 3.38 s, hence 
the classification is detector dependent Note that the 


short and long associations for the c urrent study were 
taken from MacLachlan et aTl d2013allb l): Golkhou fc Butler) 
d2014l) : lGolkhou. Butler &: Littlejohns! d2015l) . and are based 
on the classical limit of 2 s. Moreover, GRBs classified 
as short often are of collapsar origin and those classi¬ 
fied as l ong are recognized as having non- collapsar pro- 


_ng non- cc 

genitors dNakaid 120071 : Bromberg et alJT2013h . Other prop¬ 
erties commonly used as indicators a re the hardness ra¬ 
tio, spectral lag, peak energy etc. dBagolv et all 1 19981 : 
iBorgonovo fo Bi5rnssonll2006l : IZhang et ah 20091) . but they 
are still selected phenomenologically. It was also suggested 
that GRBs may be ge nerated b y a third mechani sm, most 
likely shock break out dBromberg. Nakar fe Piranll201li ). 

It was shown in a number of research, for BATSE data 
dHonq|tlijst_8jJ|20Q6j) as we ll as for Swift sample dVeres et aid 
I 2 OIOI : Horvath et aid l20ld) . that while the GRB separa¬ 
tion by duration or hardness ratio alone is ambiguous, the 


4 It appears that the Fermi dataset has a limit of 2.05 s 


(jTimojgolskJI^OHicjj^ very c lose to the conventional limit derived 
bv lKouveliotou et al.l (I1993T ). 
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Success Ratb 


Success Ratb 


Success Ratb 


Figure 5. Frequencies of successful bivariate predictions. Rows , from upper: based on HEs and MVTS, on HEs and durations T 90 , and 
on MVTS and durations T 90 . Columns , from left: successes for short GRBs, for long GRB, and for a union of both. Right panel displays 
examples of probability functions learnt by SVM. Green surface - short GRBs, red surface - long GRBs. Dots mark the training set: 
blue - short GRBs, black - long GRBs. 





Figure 6. Frequencies of successful trivariate predictions. From left: short GRBs, long GRBs, and a union of both. Right panel displays 
an example training set for a trivariate machine learning. Blue dots - short GRBs, black dots - long GRBs. 
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joint distribution in a two-dimensional space of log H 32 vs. 
logTgo allows a sharper differentiation between the GRB 
types. I11 the spirit of these findings, and in light of the 
proposition that HEs ma y serve as a GRB type indicator 
jMacLachlan et alJl2013bf ). it was verified that complement¬ 
ing the sets of logTgo and (log Tgo, log MINTS') by HEs in¬ 
creases the success rate defined in Eq. ([8|l. The fact that HE 
and MVTS alone, i.e. without T90, are less convenient is not 
fully new, but the main result herein is that HEs help in 
distinguishing GRBs of different types. 

SVM performs well in distinguishing short and long 
GRBs with data complemented by HEs. After setting the 
number of physically different GRB classes (at least two, 
and possibly three), which will require a working model that 
may hopefully predict the distributions (univariate, i.e. du¬ 
rations, hardness ratio, etc., as well as multivariate) of char¬ 
acteristic features, a supervised ML might then be applied 
to automated classification of GRBs observed in the future. 


6 CONCLUSIONS 

In this paper, GRBs observed by Fermi were investigated. 
Among ~ 1600 GRBs, there are 68 GRBs with calculated 
HE values, and a supervised ML was performed on this sub¬ 
sample. The SVM method is applied to singles, pairs and 
triples of (H, log MVTS, log Tg 0 ). The HE and MVTS (pa¬ 
rameters new in GRB classification) alone perform rather 
poor in distinguishing between short and long GRBs due 
to a large overlap in their distributions. However, when the 
pairs (log MVTS, log T90) are complemented by HEs, the 
accuracy of classification increases by 7% and exceeds 95%, 
i.e., introduction of H in the classification scheme gives more 
accurate results. Hence, HEs are suggested as indicator can¬ 
didates for distinguishing short and long GRBs, however 
not alone, as it is commonly done with durations T90, but 
as part of an association of parameters. Particularly, in this 
paper an association of three parameters (H, MVTS, T90) 
was examined. 

It is suggested that HEs will be useful in classifying 
GRBs after the number of their classes is unambiguously de¬ 
termined, possibly by constructing a working model rather 
than by a phenomenological approach. As the shape of the 
HE distribution could not be determined accurately due to 
a small sample (consisting of 68 GRBs), a bigger, preferably 
complete, sample of Fermi GRBs might reveal new prop¬ 
erties of the GRB population that could either challenge 
present theoretical models, or give hints on how to construct 
them. 
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